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CONCURRENT REMEMBERED-SET INSERTION IN A 
GENERATION MANAGED BY THE TRAIN ALGORITHM 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention is directed to memory management. It particularly con- 
cems what has come to be known as "garbage collection." 

Background Information 

In the field of computer systems, considerable effort has been expended on the 
task of allocating memory to data objects. For the purposes of this discussion, the term 
object refers to a data structure represented in a computer system's memory. Other terms 
sometimes used for the same concept are record and structure. An object may be identi- 
fied by a reference, a relatively small amount of information that can be used to access 
the object. A reference can be represented as a "pointer" or a "machine address," which 
may require, for instance, only sixteen, thirty-two, or sixty-four bits of information, al- 
though there are other ways to represent a reference. 

In some systems, which are usually known as "object oriented," objects may have 
associated methods, which are routines that can be invoked by reference to the object. 
They also may belong to a class, which is an organizational entity that may contain 
method code or other information shared by all objects belonging to that class. In the 
discussion that follows, though, the term object will not be limited to such structures; it 
will additionally include structures with which methods and classes are not associated. 

The invention to be described below is applicable to systems that allocate memory 
to objects dynamically. Not all systems employ dynamic allocation. In some computer 

1 

H:\l 12\047\0087\PROSECUTVPATAPP.doc 08/20/03 2:29 PM 



PATENT 
112047-0070 



The invention to be described below is applicable to systems that allocate memory 
to objects dynamically. Not all systems employ dynamic allocation. In some computer 
languages, source programs must be so written that all objects to which the program's 
variables refer are bound to storage locations at compile time. This storage-allocation 
approach, sometimes referred to as "static allocation," is the policy traditionally used by 
the Fortran programming language, for example. 

Even for compilers that are thought of as allocatmg objects only statically, of 
course, there is often a certain level of abstraction to this binding of objects to storage 
locations. Consider the typical computer system 10 depicted in Fig. 1, for example. 
Data, and instructions for operating on them, that a microprocessor 1 1 uses may reside in 
on-board cache memory or be received from further cache memory 12, possibly through 
the mediation of a cache controller 13. That controller 13 can in turn receive such data 
from system read/write memory ("RAM") 14 through a RAM controller 15 or from vari- 
ous peripheral devices through a system bus 16. The memory space made available to an 
application program may be "virtual" m the sense that it may actually be considerably 
larger than RAM 14 provides. So the RAM contents will be swapped to and from a sys- 
tem disk 17. 

Additionally, the actual physical operations performed to access some of the 
most-recently visited parts of the process's address space often will actually be performed 
m the cache 12 or in a cache on board microprocessor 1 1 rather than on the RAM 14, 
with which those caches swap data and instructions just as RAM 14 and system disk 17 
do with each other. 

A fiirther level of abstraction results from the fact that an application will often be 
run as one of many processes operating concurrently with the support of an underlying 
operating system. As part of that system's memory management, the application's mem- 
ory space may be moved among different actual physical locations many times in order to 
allow different processes to employ shared physical memory devices. That is, the loca- 
tion specified in the application's machine code may actually resuU in different physical 
locations at different times because the operating system adds different offsets to the ma- 
chine-language-specified location. 
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Despite these expedients, the use of static memory allocation in writing certain 
long-lived applications makes it difficvdt to restrict storage requirements to the available 
memory space. Abiding by space limitations is easier when the platform provides for 
dynamic memory allocation, i.e., when memory space to be allocated to a given object is 
5 determined only at run time. 

Dynamic allocation has a number of advantages, among which is that the run-time 
system is able to adapt allocation to run-time conditions. For example, the programmer 
can specify tiiat space should be allocated for a given object only in response to a par- 
ticular run-time condition. The C-language library function mallocQ is often used for this 
10 purpose. Conversely, the programmer can specify conditions under which memory pre- 
viously allocated to a given object can be reclaimed for reuse. The C-language library 
fimction freeQ results in such memory reclamation. 

Because dynamic allocation provides for memory reuse, it facilitates generation 
of large or long-lived applications, which over the course of their lifetimes may employ 
15 objects whose total memory requirements would greatly exceed the available memory 
resources if they were bound to memory locations statically. 

Particularly for long-lived applications, though, allocation and reclamation of dy- 
namic memory must be performed carefully. If the application fails to reclaim imused 
memory — or, worse, loses track of the address of a dynamically allocated segment of 
20 memory — its memory requirements will grow over time to exceed the system's available 
memory. This kind of error is known as a "memory leak." 

Another kind of error occurs when an application reclaims memory for reuse even 
though it still maintains a reference to that memory. If the reclaimed memory is reallo- 
cated for a different purpose, the application may inadvertently manipulate the same 
25 memory in multiple inconsistent ways. This kind of error is known as a "dangling refer- 
ence," because an application should not retain a reference to a memory location once 
that location is reclaimed. Explicit dynamic-memory management by usmg interfaces 
like mallocQ/freeO often leads to these problems. 
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A way of reducing the likelihood of such leaks and related errors is to provide 
memory-space reclamation in a more-automatic manner. Techniques used by systems 
that reclaim memory space automatically are commonly referred to as "garbage collec- 
tion." Garbage collectors operate by reclaiming space that they no longer consider 

5 "reachable." Statically allocated objects represented by a program's global variables are 
normally considered reachable throughout a program's life. Such objects are not ordi- 
narily stored in the garbage collector's managed memory space, but they may contain 
references to dynamically allocated objects that are, and such objects are considered 
reachable. Clearly, an object referred to in the processor's call stack is reachable, as is an 

10 object referred to by register contents. And an object referred to by any reachable object 
is also reachable. 

The use of garbage collectors is advantageous because, whereas a progranmier 
working on a particular sequence of code can perform his task creditably in most respects 
with only local knowledge of the application at any given time, memory allocation and 

15 reclamation require a global knowledge of the program. Specifically, a programmer 
dealing with a given sequence of code does tend to know whether some portion of mem- 
ory is still in use for that sequence of code, but it is considerably more difficult for him to 
know what the rest of the application is doing with that memory. By tracing references 
from some conservative notion of a "root set," e.g., global variables, registers, and the 

20 call stack, automatic garbage collectors obtain global knowledge in a methodical way. 
By using a garbage collector, the programmer is relieved of the need to worry about the 
application's global state and can concentrate on local-state issues, which are more man- 
ageable. The result is applications that are more robust, having no dangling references 
and fewer memory leaks. 

25 Garbage-collection mechanisms can be implemented by various parts and levels 

of a computing system. One approach is simply to provide them as part of a batch com- 
piler's output. Consider Fig. 2's simple batch-compiler operation, for example. A com- 
puter system executes in accordance v^th compiler object code and therefore acts as a 
compiler 20. The compiler object code is typically stored on a medium such as Fig. 1 's 

30 system disk 17 or some other machine-readable medium, and it is loaded into RAM 14 to 
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configure the computer system to act as a compiler. In some cases, though, the compiler 
object code's persistent storage may instead be provided in a server system remote from 
the machine that performs the compiling. The electrical signals that carry the digital data 
by which the computer systems exchange that code are examples of the kinds of electro- 
5 magnetic signals by which the computer instructions can be communicated. Others are 
radio waves, microwaves, and both visible and invisible light. 

The input to the compiler is the application source code, and the end product of 
the compiler process is application object code. This object code defines an applica- 
tion 21, which typically operates on input such as mouse clicks, etc., to generate a display 
10 or some other type of output. This object code implements the relationship that the pro- 
grammer intends to specify by his application source code. In one approach to garbage 
collection, the compiler 20, without the programmer's explicit direction, additionally 
generates code that automatically reclaims unreachable memory space. 

Even in this simple case, though, there is a sense in which the application does not 
15 itself provide the entire garbage collector. Specifically, the application will typically call 
upon the imderlying operating system's memory-allocation fimctions. And the operating 
system may in tum take advantage of various hardware that lends itself particularly to use 
in garbage collection. So even a very simple system may disperse the garbage-collection 
mechanism over a number of computer-system layers. 

20 To get some sense of the variety of system components that can be used to im- 

plement garbage collection, consider Fig. 3's example of a more complex way in which 
various levels of source code can result in the machine instructions that a processor exe- 
cutes. In the Fig. 3 arrangement, the human applications programmer produces source 
code 22 written in a high-level language. A compiler 23 typically converts that code into 

25 "class files." These files include routines written in instructions, called "byte codes" 24, 
for a "virtual machine" that various processors can be software-configured to emulate. 
This conversion into byte codes is almost always separated in time from those codes' 
execution, so Fig. 3 divides the sequence into a "compile-time environment" 25 separate 
from a "run-time environment" 26, in which execution occurs. One example of a high- 

30 level language for which compilers are available to produce such virtual-machine in- 
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structions is the Java™ programming language. (Java is a trademark or registered 
trademark of Sun Microsystems, Inc., in the United States and other countries.) 

Most typically, the class files' byte-code routines are executed by a processor un- 
der control of a virtual-machine process 27. That process emulates a virtual machine 

5 from whose instruction set the byte codes are drawn. As is true of the compiler 23, the 
virtual-machine process 27 may be specified by code stored on a local disk or some other 
machine-readable medium from which it is read into Fig. Vs RAM 14 to configure the 
computer system to implement the garbage collector and otherwise act as a virtual ma- 
chine. Again, though, that code's persistent storage may instead be provided by a server 

10 . system remote from the processor that implements the virtual machine, in which case the 
code would be transmitted electrically or optically to the virtual-machine-implementing 
processor. 

In some implementations, much of the virtual machine's action in executing these 
byte codes is most like what those skilled in the art refer to as "interpreting," so Fig. 3 

15 depicts the virtual machine as including an "interpreter" 28 for that purpose. In addition 
to or instead of running an interpreter, many virtual-machine implementations actually 
compile the byte codes concurrently with the resultant object code's execution, so Fig. 3 
depicts the virtual machine as additionally including a "just-in-time" compiler 29. We 
will refer to the just-in-time compiler and the interpreter together as "execution engines" 

20 since they are the methods by which byte code can be executed. 

Now, some of the functionality that source-language constructs specify can be 
quite complicated, requiring many machine-language instructions for their implementa- 
tion. One quite-common example is a source-language instruction that calls for 64-bit 
arithmetic on a 32-bit machine. More germane to the present invention is the operation 
25 of dynamically allocating space to a new object; the allocation of such objects must be 
mediated by the garbage collector. 

In such situations, the compiler may produce "inline" code to accomplish these 
operations. That is, all object-code instructions for carrying out a given source-code- 
prescribed operation will be repeated each time the source code calls for the operation. 
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But inlining runs the risk that ''code bloat" will result if the operation is invoked at many 
source-code locations. 

The natural way of avoiding this result is instead to provide the operation's im- 
plementation as a procedure, i.e., a single code sequence that can be called from any lo- 

s cation in the program. In the case of compilers, a collection of procedures for imple- 
menting many types of source-code-specified operations is called a runtime system for 
the language. The execution engines and the runtime system of a virtual machine are de- 
signed together so that the engines "know" what runtime-system procedures are available 
in the virtual machine (and on the target system if that system provides facilities that are 

10 directly usable by an executing virtual-machine program.) So, for example, the just-in- 
time compiler 29 may generate native code that includes calls to memory-allocation pro- 
cedures provided by the virtual machine's runtime system. These allocation routines may 
in tum invoke garbage-collection routines of the runtime system when there is not 
enough memory available to satisfy an allocation. To represent this fact. Fig. 3 includes 

15 block 30 to show that the compiler's output makes calls to the runtime system as well as 
to the operating system 3 1 , which consists of procedures that are similarly system- 
resident but are not compiler-dependent. 

Although the Fig. 3 arrangement is a popular one, it is by no means universal, and 
many further implementation types can be expected. Proposals have even been made to 
20 implement the virtual machine 27's behavior in a hardware processor, in which case the 
hardware itself would provide some or all of the garbage-collection function. 

The arrangement of Fig. 3 differs from Fig. 2 in that the compiler 23 for convert- 
ing the human programmer's code does not contribute to providing the garbage- 
collection function; that results largely from the virtual machine 27 's operation. Those 
25 skilled in that art will recognize that both of these organizations are merely exemplary, 
and many modem systems employ hybrid mechanisms, which partake of the characteris- 
tics of traditional compilers and traditional interpreters both. 

The invention to be described below is applicable independently of whether a 
batch compiler, a just-in-time compiler, an interpreter, or some hybrid is employed to 
30 process source code. In the remainder of this application, therefore, we will use the term 
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compiler to refer to any such mechanism, even if it is what would more typically be 
called an interpreter. 

In short, garbage collectors can be implemented in a wide range of combinations 
of hardware and/or software. As is true of most of the garbage-collection techniques de- 
scribed in the literature, the invention to be described below is applicable to most such 
systems. 

By implementing garbage collection, a computer system can greatly reduce the 
occurrence of memory leaks and other software deficiencies in which human program- 
ming fi-equently results. But it can also have significant adverse performance effects if it 
is not implemented carefiiUy. To distinguish the part of the program that does '\isefiil" 
work from that which does the garbage collection, the term mutator is sometimes used in 
discussions of these effects; from the collector's point of view, what the mutator does is 
mutate active data structures' connectivity. 

Some garbage-collection approaches rely heavily on interleaving garbage- 
collection steps among mutator steps. In one type of garbage-collection approach, for 
instance, the mutator operation of writing a reference is followed immediately by gar- 
bage-collector steps used to maintain a reference count in that object's header, and code 
for subsequent new-object storage includes steps for finding space occupied by objects 
whose reference count has fallen to zero. Obviously, such an approach can slow mutator 
operation significantly. 

Other approaches therefore interleave very few garbage-collector-related instrac- 
tions into the main mutator process but instead interrupt it from time to time to perform 
garbage-collection cycles, in which the garbage collector finds unreachable objects and 
reclaims their memory space for reuse. Such an approach will be assumed in discussing 
Fig. 4's depiction of a simple garbage-collection operation. Within the memory space 
allocated to a given application is a part 40 managed by automatic garbage collection. In 
the following discussion, this will be referred to as the "heap," although in other contexts 
that term refers to all dynamically allocated memory. During the course of the applica- 
tion' s execution, space is allocated for various objects 42, 44, 46, 48, and 50. Typically, 
the mutator allocates space within the heap by invoking the garbage collector, which at 
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some level manages access to the heap. Basically, the mutator asks the garbage collector 
for a pointer to a heap region where it can safely place the object's data. The garbage 
collector keeps track of the fact that the thus-allocated region is occupied. It will refrain 
from allocating that region in response to any other request until it determines that the 
5 mutator no longer needs the region allocated to that object. 

Garbage collectors vary as to which objects they consider reachable and imreach- 
able. For the present discussion, though, an object will be considered "reachable" if it is 
referred to, as object 42 is, by a reference in the root set 52. The root set consists of ref- 
erence values stored in the mutator's threads' call stacks, the CPU registers, and global 
10 variables outside the garbage-collected heap. An object is also reachable if it is referred 
to, as object 46 is, by another reachable object (in this case, object 42). Objects that are 
not reachable can no longer affect the program, so it is safe to re-allocate the memory 
spaces that they occupy. 

A typical approach to garbage collection is therefore to identify all reachable ob- 
is jects and reclaim any previously allocated memory that the reachable objects do not oc- 
cupy. A typical garbage collector may identify reachable objects by tracing references 
from the root set 52. For the sake of simplicity. Fig. 4 depicts only one reference from 
the root set 52 into the heap 40. (Those skilled in the art will recognize that there are 
many ways to identify references, or at least data contents that may be references.) The 
20 collector notes that the root set points to object 42, which is therefore reachable, and that 
reachable object 42 points to object 46, which therefore is also reachable. But those 
reachable objects point to no other objects, so objects 44, 48, and 50 are all unreachable, 
and their memory space may be reclaimed. This may involve, say, placing that memory 
space in a list of free memory blocks. 

25 To avoid excessive heap fragmentation, some garbage collectors additionally re- 

locate reachable objects. Fig. 5 shows a typical approach. The heap is partitioned into 
two halves, hereafter called "semi-spaces.'' For one garbage-collection cycle, all objects 
are allocated in one semi-space 54, leaving the other semi-space 56 free. When the gar- 
bage-collection cycle occurs, objects identified as reachable are "evacuated" to the other 

30 semi-space 56, so all of semi-space 54 is then considered free. Once the garbage- 
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collection cycle has occurred, all new objects are allocated in the lower semi-space 56 
until yet another garbage-collection cycle occurs, at which time the reachable objects are 
evacuated back to the upper semi-space 54. 

Although this relocation requires the extra steps of copying the reachable objects 
and updating references to them, it tends to be quite efficient, since most new objects 
quickly become unreachable, so most of the current semi-space is actually garbage. That 
is, only a relatively few, reachable objects need to be relocated, after which the entire 
semi-space contains only garbage and can be pronoimced free for reallocation. 

Now, a collection cycle can involve following all reference chains from the basic 
root set — i.e., from inherently reachable locations such as the call stacks, class statics and 
other global variables, and registers — ^and reclaiming all space occupied by objects not 
encoxmtered in the process. And the simplest way of performing such a cycle is to inter- 
rupt the mutator to provide a collector interval in which the entire cycle is performed be- 
fore the mutator resumes. For certain types of applications, this approach to collection- 
cycle scheduling is acceptable and, in fact, highly efficient. 

For many interactive and real-time applications, though, this approach is not ac- 
ceptable. The delay in mutator operation that the collection cycle's execution causes can 
be annoying to a user and can prevent a real-time application from responding to its envi- 
ronment with the required speed. In some applications, choosing collection times op- 
portunistically can reduce this effect. Collection intervals can be inserted when an inter- 
active mutator reaches a point at which it awaits user input, for instance. 

So it may often be true that the garbage-collection operation's effect on perform- 
ance can depend less on the total collection time than on when collections actually occur. 
But another factor that often is even more determinative is the duration of any single 
collection interval, i.e., how long the mutator must remain quiescent at any one time. In 
an interactive system, for instance, a user may never notice hundred-millisecond inter- 
ruptions for garbage collection, whereas most users would find interruptions lasting for 
two seconds to be annoying. 
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The cycle may therefore be divided up among a plurality of collector intervals. 
When a collection cycle is divided up among a plurality of collection intervals, it is only 
after a number of intervals that the collector will have followed all reference chains and 
be able to identify as garbage any objects not thereby reached. This approach is more 
complex than completing the cycle in a single collection interval; the mutator will usually 
modify references between collection intervals, so the collector must repeatedly update 
its view of the reference graph in the midst of the collection cycle. To make such updates 
practical, the mutator must communicate with the collector to let it know what reference 
changes are made between intervals. 

An even more complex approach, which some systems use to eliminate discrete 
pauses or maximize resource-use efficiency, is to execute the mutator and collector in 
concurrent execution threads. Most systems that use this approach use it for most but not 
all of the collection cycle; the mutator is usually interrupted for a short collector interval, 
in which a part of the collector cycle takes place without mutation. 

Independent of whether the collection cycle is performed concurrently with mu- 
tator operation, is completed in a single interval, or extends over multiple intervals is the 
question of whether the cycle is complete, as has tacitly been assumed so far, or is instead 
"incremental.'' In incremental collection, a collection cycle constitutes only an increment 
of collection: the collector does not follow all reference chains from the basic root set 
completely. Instead, it concentrates on only a portion, or collection set, of the heap. 
Specifically, it identifies every collection-set object referred to by a reference chain that 
extends into the collection set from outside of it, and it reclaims the collection-set space 
not occupied by such objects, possibly after evacuating them from the collection set. 

By thus culling objects referenced by reference chains that do not necessarily 
originate in the basic root set, the collector can be thought of as expanding the root set to 
include as roots some locations that may not be reachable. Although incremental collec- 
tion thereby leaves "floating garbage," it can result in relatively low pause times even if 
entire collection increments are completed during respective single collection intervals. 

Most collectors that employ incremental collection operate in "generations," al- 
though this is not necessary in principle. Different portions, or generations, of the heap 
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are subject to different collection policies. New objects are allocated in a "young" gen- 
eration, and older objects are promoted from younger generations to older or more "ma- 
ture" generations. Collecting the yoimger generations more frequently than the others 
yields greater efficiency because the younger generations tend to accumulate garbage 
faster; newly allocated objects tend to "die," while older objects tend to "survive." 

But generational collection greatly increases what is effectively the root set for a 
given generation. Consider Fig. 6, which depicts a heap as organized into three genera- 
tions 58, 60, and 62. Assume that generation 60 is to be collected. The process for this 
individual generation may be more or less the same as that described in connection with 
Figs. 4 and 5 for the entire heap, with one major exception. In the case of a single gen- 
eration, the root set must be considered to include not only the call stack, registers, and 
global variables represented by set 52 but also objects in the other generations 58 and 62, 
which themselves may contain references to objects in generation 60. So pointers must 
be traced not only from the basic root set 52 but also from objects within the other gen- 
erations. 

One could perform this tracing by simply inspecting all references in all other 
generations at the beginning of every collection interval, and it turns out that this ap- 
proach is actually feasible in some situations. But it takes too long in other situations, so 
workers in this field have employed a number of approaches to expediting reference 
tracing. One approach is to include so-called write barriers in the mutator process. A 
write barrier is code added to a vmte operation to record information from which the 
collector can determine where references were written or may have been since the last 
collection interval. A reference list can then be maintained by taking such a list as it ex- 
isted at the end of the previous collection interval and updating it by inspecting only lo- 
cations identified by the write barrier as possibly modified since the last collection inter- 
val. 

One of the many write-barrier implementations commonly used by workers in this 
art employs what has been referred to as the "card table." Fig. 6 depicts the various gen- 
erations as being divided into smaller sections, known for this purpose as "cards." Card 
tables 64, 66, and 68 associated vsdth respective generations contain an entry for each of 
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their cards. When the mutator writes a reference in a card, it makes an appropriate entry 
in the card-table location associated with that card (or, say, with the card in which the 
object containing the reference begins). Most write-barrier implementations simply make 
a Boolean entry indicating that the write operation has been performed, although some 
may be more elaborate. The mutator having thus left a record of where new or modified 
references may be, the collector can thereafter prepare appropriate summaries of that in- 
formation, as will be explained in due course. For the sake of concreteness, we will as- 
sume that the summaries are maintained by steps that occur principally at the beginning 
of each collection interval. 

Of course, there are other write-barrier approaches, such as simply having the 
write barrier add to a list of addresses where references where written. Also, although 
there is no reason in principle to favor any particular number of generations, and although 
Fig. 6 shows three, most generational garbage collectors have only two generations, of 
which one is the young generation and the other is the mature generation. Moreover, al- 
though Fig. 6 shows the generations as being of the same size, a more-typical configura- 
tion is for the young generation to be considerably smaller. Finally, although we as- 
sumed for the sake of simplicity that collection during a given interval was limited to 
only one generation, a more-typical approach is actually to collect the whole young gen- 
eration at every interval but to collect the mature one less frequently. 

Some collectors collect the entire young generation in every interval and may 
thereafter perform mature-generation collection in the same interval. It may therefore 
take relatively little time to scan all young-generation objects remaining after young- 
generation collection to find references into the mature generation. Even when such col- 
lectors do use card tables, therefore, they often do not use them for finding young- 
generation references that refer to mature-generation objects. On the other hand, labori- 
ously scanning the entire mature generation for references to young-generation (or ma- 
ture-generation) objects would ordinarily take too long, so the collector uses the card ta- 
ble to limit the amount of memory it searches for mature-generation references. 

Now, although it typically takes very little time to collect the young generation, it 
may take more time than is acceptable within a single garbage-collection cycle to collect 
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the entire mature generation. So some garbage collectors may collect the mature genera- 
tion incrementally; that is, they may perform only a part of the mature generation's col- 
lection during any particular collection cycle. Incremental collection presents the prob- 
lem that, since the generation's unreachable objects outside the "collection set" of objects 
processed during that cycle cannot be recognized as unreachable, collection-set objects to 
which they refer tend not to be, either. 

To reduce the adverse effect this would otherwise have on collection efficiency, 
workers in this field have employed the '"train algorithm," which Fig. 7 depicts. A gen- 
eration to be collected incrementally is divided into sections, which for reasons about to 
be described are referred to as "car sections." Conventionally, a generation's incremental 
collection occurs in fixed-size sections, and a car section's size is that of the generation 
portion to be collected during one cycle. 

The discussion that follows will occasionally employ the nomenclature in the lit- 
erature by using the term car instead of car section. But the literature seems to use that 
term to refer variously not only to memory sections themselves but also to data structures 
that the train algorithm employs to manage them when they contain objects, as well as to 
the more-abstract concept that the car section and managing data structure represent in 
discussions of the algorithm. So the following discussion will more frequently use the 
expression car section to emphasize the actual sections of memory space for whose man- 
agement the car concept is employed. 

According to the train algorithm, the car sections are grouped into "trains," which 
are ordered, conventionally according to age. For example. Fig. 7 shows an oldest 
train 73 consisting of a generation 74's three car sections described by associated data 
structures 75, 76, and 78, while a second train 80 consists only of a single car section, 
represented by structure 82, and the youngest train 84 (referred to as the "allocation 
train") consists of car sections that data structures 86 and 88 represent. As will be seen 
below, car sections' train memberships can change, and any car section added to a train is 
typically added to the end of a train. 

Conventionally, the car collected in an increment is the one added earliest to the 
oldest train, which in this case is car 75. All of the generation's cars can thus be thought 

14 



PATENT 
112047-0070 

of as waiting for collection in a single long line, in which cars are ordered in accordance 
with the order of the trains to which they belong and, within trains, in accordance with 
the order in which they were added to those trains. 

As is usual, the way in which reachable objects are identified is to determine 
whether there are references to them in the root set or in any other object already deter- 
mined to be reachable. In accordance with the train algorithm, the collector additionally 
performs a test to determme whether there are any references at all from outside the old- 
est train to objects withm it. If there are not, then all cars within the train can be re- 
claimed, even though not all of those cars are in the collection set. And the train algo- 
rithm so operates that inter-car references tend to be grouped into trains, as will now be 
explained. 

To identify references into the car from outside of it, train-algorithm implementa- 
tions typically employ "remembered sets." As card tables are, remembered sets are used 
to keep track of references. Whereas a card-table entry contains information about refer- 
ences that the associated card contains, though, a remembered set associated with a given 
region contains information about references mto that region from locations outside of it. 
In the case of the train algorithm, remembered sets are associated with car sections. Each 
remembered set, such as car 75's remembered set 90, lists locations in the generation that 
contain references into the associated car section. 

The remembered sets for all of a generation's cars are typically updated at the 
start of each collection cycle. To illustrate how such updating and other collection op- 
erations may be carried out. Figs. 8A and 8B (together, "Fig. 8") depict an operational 
sequence in a system of the typical type mention above. That is, it shows a sequence of 
operations that may occur in a system in which the entire garbage-collected heap is di- 
vided into two generations, namely, a young generation and an old generation, and in 
which the young generation is much smaller than the old generation. Fig. 8 is also based 
on the assumption and that the train algorithm is used only for collecting the old genera- 
tion. 

Block 102 represents a period of the mutator's operation. As was explained 
above, the mutator makes a card-table entry to identify any card that it has "dirtied" by 
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adding or modifying a reference that the card contains. At some point, the mutator will 
be interrupted for collector operation. Different implementations employ different events 
to trigger such an interruption, but we will assume for the sake of concreteness that the 
system's dynamic-allocation routine causes such interruptions when no room is left in the 
young generation for any further allocation. A dashed line 103 represents the transition 
from mutator operation and collector operation. 

In the system assumed for the Fig. 8 example, the collector collects the (entire) 
young generation each time such an interruption occurs. When the young generation's 
collection ends, the mutator operation usually resumes, without the collector's having 
collected any part of the old generation. Once in a while, though, the collector also col- 
lects part of the old generation, and Fig. 8 is intended to illustrate such an occasion. 

When the collector's interval first starts, it first processes the card table, in an op- 
eration that block 104 represents. As was mentioned above, the collector scans the "dirt- 
ied" cards for references into the young generation. If a reference is found, that fact is 
memorialized appropriately. If the reference refers to a young-generation object, for ex- 
ample, an expanded card table may be used for this purpose. For each card, such an ex- 
panded card table might include a multi-byte array used to sununarize the card's refer- 
ence contents. The summary may, for instance, be a list of offsets that indicate the exact 
locations within the card of references to young-generation objects, or it may be a list of 
fine- granularity "sub-cards" within which references to young-generation objects may be 
foimd. If the reference refers to an old-generation object, the collector often adds an en- 
try to the remembered set associated with the car containing that old-generation object. 
The entry identifies the reference's location, or at least a small region in which the refer- 
ence can be found. For reasons that will become apparent, though, the collector will 
typically not bother to place in the remembered set the locations of references fi-om ob- 
jects in car sections farther forward in the collection queue than the referred-to object, 
i.e., from objects in older trains or in cars added earlier to the same train. 

The collector then collects the young generation, as block 105 indicates. (Actu- 
ally, young-generation collection may be interleaved with the dirty-region scanning, but 
the drawing illustrates it for purpose of explanation as being separate.) If a young- 
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generation object is referred to by a reference that card-table scanning has revealed, that 
object is considered to be potentially reachable, as is any young-generation object re- 
ferred to by a reference in the root set or in another reachable young-generation object. 
The space occupied by any young-generation object thus considered reachable is with- 
held from reclamation. For example, it may be evacuated to a young-generation semi- 
space that will be used for allocation during the next mutator interval. It may instead be 
promoted into the older generation, where it is placed into a car containing a reference to 
it or into a car in the last train. Or some other technique may be used to keep the memory 
space it occupies off the system's free list. The collector then reclaims any young- 
generation space occupied by any other objects, i.e., by any young-generation objects not 
identified as transitively reachable through references located outside the young genera- 
tion. 

The collector then performs the train algorithm's central test, referred to above, of 
determining whether there are any references into the oldest train from outside of it. As 
was mentioned above, the actual process of determining, for each object, whether it can 
be identified as unreachable is performed for only a single car section in any cycle. In the 
absence of features such as those provided by the train algorithm, this would present a 
problem, because garbage structures may be larger than a car section. Objects in such 
structures wovdd therefore (erroneously) appear reachable, since they are referred to from 
outside the car section under consideration. But the train algorithm additionally keeps 
track of whether there are any references into a given car from outside the train to which 
it belongs, and trains' sizes are not limited. As will be apparent presently, objects not 
found to be unreachable are relocated in such a way that garbage structures tend to be 
gathered into respective trains into which, eventually, no references from outside the train 
point. If no references from outside the train point to any objects inside the train, the 
train can be recognized as containing only garbage. This is the test that block 106 repre- 
sents. All cars in a train thus identified as containing only garbage can be reclaimed. 

The question of whether old-generation references point into the train from out- 
side of it is (conservatively) answered in the course of updating remembered sets; in the 
course of updating a car's remembered set, it is a simple matter to flag the car as being 
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referred to from outside the train. The step- 106 test additionally involves determining 
whether any references from outside the old generation point into the oldest train. Vari- 
ous approaches to making this determination have been suggested, including the concep- 
tually simple approach of merely following all reference chains from the root set until 
those chains (1) terminate, (2) reach an old-generation object outside the oldest train, or 
(3) reach an object in the oldest train. In the two-generation example, most of this work 
can be done readily by identifying references into the collection set from live young- 
generation objects during the young-generation collection. If one or more such chains 
reach the oldest train, that train includes reachable objects. It may also include reachable 
objects if the remembered-set-update operation has found one or more references into the 
oldest train from outside of it. Otherwise, that train contains only garbage, and the col- 
lector reclaims all of its car sections for reuse, as block 107 indicates. The collector may 
then return control to the mutator, which resumes execution, as Fig. SB's block 108 indi- 
cates. 

If the train contains reachable objects, on the other hand, the collector tums to 
evacuating potentially reachable objects from the collection set. The first operation, 
which block 110 represents, is to remove from the collection set any object that is reach- 
able from the root set by way of a reference chain that does not pass through the part of 
the old generation that is outside of the collection set. In the illustrated arrangement, in 
which there are only two generations, and the yoimg generation has previously been 
completely collected during the same interval, this means evacuating from a collection 
set any object that (1) is directly referred to by a reference in the root set, (2) is directly 
referred to by a reference in the young generation (in which no remaining objects have 
been found unreachable), or (3) is referred to by any reference in an object thereby 
evacuated. All of the objects thus evacuated are placed in cars in the youngest train, 
which was newly created during the collection cycle. Certain of the mechanics involved 
in the evacuation process are described in more detail in coimection with similar evacua- 
tion performed, as blocks 1 12 and 114 indicate, m response to remembered-set entries. 

Fig. 9 illustrates how the processing represented by block 114 proceeds. The en- 
tries identify heap regions, and, as block 116 indicates, the collector scans the thus- 
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identified heap regions to find references to locations in the collection-set. As blocks 118 
and 120 indicate, that entry's processing continues until the collector finds no more such 
references. Every time the collector does fmd such a reference, it checks to determine 
whether, as a result of a previoxis entry's processing, the referred-to object has already 
been evacuated. If it has not, the collector evacuates the referred-to object to a (possibly 
new) car in the train containing the reference, as blocks 122 and 124 indicate. 

As Fig. 10 indicates, the evacuation operation includes more than just object relo- 
cation, which block 126 represents. Once the object has been moved, the collector places 
a forwarding pointer in the collection-set location from which it was evacuated, for a 
purpose that will become apparent presently. Block 128 represents that step. (Actually, 
there are some cases in which the evacuation is only a "logical" evacuation: the car con- 
taining the object is simply re-linked to a different logical place in the collection se- 
quence, but its address does not change. In such cases, forwarding pointers are uimeces- 
sary.) Additionally, the reference in response to which the object was evacuated is up- 
dated to point to the evacuated object's new location, as block 130 indicates. And, as 
block 132 indicates, any reference contained in the evacuated object is processed, in an 
operation that Figs. 1 1 A and 1 IB (together, "Fig. 1 1") depict. 

For each one of the evacuated object's references, the collector checks to see 
whether the location that it refers to is in the collection set. As blocks 134 and 136 indi- 
cate, the reference processing continues until all references in the evacuated object have 
been processed. In the meantime, if a reference refers to a collection-set location that 
contains an object not yet evacuated, the collector evacuates the referred-to object to the 
train to which the evacuated object containing the reference was evacuated, as blocks 138 
and 140 indicate. 

If the reference refers to a location in the collection set from which the object has 
already been evacuated, then the collector uses the forwarding pointer left in that location 
to update the reference, as block 142 indicates. Before the processing of Fig. 1 1, the re- 
membered set of the referred-to object's car will have an entry that identifies the evacu- 
ated object's old location as one containing a reference to the referred-to object. But the 
evacuation has placed the reference in a new location, for which the remembered set of 
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the referred-to object's car may not have an entry. So, if that new location is not as far 
forward as the referred-to object, the collector adds to that remembered set an entry iden- 
tifying the reference's new region, as blocks 144 and 146 indicate. As the drawings 
show, the same type of remembered-set update is performed if the object referred to by 
the evacuated reference is not in the collection set. 

Now, some train-algorithm implementations postpone processing of the refer- 
ences contained in evacuated collection-set objects until after all directly reachable col- 
lection-set objects have been evacuated. In the implementation that Fig. 10 illustrates, 
though, the processing of a given evacuated object's references occurs before the next 
object is evacuated. So Fig. 1 1 's blocks 134 and 148 indicate that the Fig. 1 1 operation is 
completed when all of the references contained in the evacuated object have been proc- 
essed. This completes Fig. lO's object-evacuation operation, which Fig. 9's block 124 
represents. 

As Fig. 9 indicates, each collection-set object referred to by a reference in a re- 
membered-set-entry-identified location is thus evacuated if it has not been already. If the 
object has already been evacuated from the referred-to location, the reference to that lo- 
cation is updated to point to the location to which the object has been evacuated. If the 
remembered set associated with the car containing the evacuated object's new location 
does not include an entry for the reference's location, it is updated to do so if the car 
containing the reference is younger than the car containing the evacuated object. 
Block 150 represents updating the reference and, if necessary, the remembered set. 

As Fig. 8's blocks 1 12 and 1 14 indicate, this processing of collection-set remem- 
bered sets is performed initially only for entries that do not refer to locations in the oldest 
train. Those that do are processed only after all others have been, as blocks 152 and 154 
indicate. 

When this process has been completed, the collection set's memory space can be 
reclaimed, as block 164 indicates, since no remaining object is referred to from outside 
the collection set: any remaining collection-set object is unreachable. The collector then 
relinquishes control to the mutator. 
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Figs. 12A-12J illustrate resxilts of using the train algorithm. Fig. 12A represents a 
generation in which objects have been allocated in nine car sections. The oldest train has 
four cars, numbered 1.1 through 1 .4. Car 1.1 has two objects, A and B. There is a refer- 
ence to object B in the root set (which, as was explained above, includes live objects in 
the other generations). Object A is referred to by object L, which is in the third train's 
sole car section. In the generation's remembered sets 170, a reference in object L has 
therefore been recorded against car 1 . 1 . 

Processing always starts with the oldest train's earliest-added car, so the garbage 
collector refers to car 1 . 1's remembered set and finds that there is a reference from ob- 
ject L into the car being processed. It accordingly evacuates object A to the train that 
object L occupies. The object being evacuated is often placed in one of the selected 
train's existing cars, but we will assume for present purposes that there is not enough 
room. So the garbage collector evacuates object A into a new car section and updates 
appropriate data structures to identify it as the next car in the third train. Fig. 12B depicts 
the result: a new car has been added to the third train, and object A is placed in it. 

Fig. 12B also shows that object B has been evacuated to a new car outside the 
first train. This is because object B has an extemal reference, which, like the reference to 
object A, is a reference from outside the first train, and one goal of the processing is to 
form trains into which there are no further references. Note that, to maintain a reference 
to the same object, object L's reference to object A has had to be rewritten, and so have 
object B's reference to object A and the inter-generational pointer to object B. In the il- 
lustrated example, the garbage collector begins a new train for the car into which object B 
is evacuated, but this is not a necessary requirement of the train algorithm. That algo- 
rithm requires only that extemally referenced objects be evacuated to a newer train. 

Since car 1.1 no longer contains live objects, it can be reclaimed, as Fig. 12B also 
indicates. Also note that the remembered set for car 2. 1 now includes the address of a 
reference in object A, whereas it did not before. As was stated before, remembered sets 
in the illustrated embodiment include only references from cars further back in the order 
than the one with which the remembered set is associated. The reason for this is that any 
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other cars will already be reclaimed by the time the car associated with that remembered 
set is processed, so there is no reason to keep track of references from them. 

The next step is to process the next car, the one whose index is 1.2. Convention- 
ally, this would not occur until some collection cycle after the one during which car 1 .1 is 
collected. For the sake of simplicity we will assimie that the mutator has not changed any 
references into the generation in the interim. 

Fig. 12B depicts car 1 .2 as containing only a single object, object C, and that car's 
remembered set contains the address of an inter-car reference from object F. The garbage 
collector follows that reference to object C. Since this identifies object C as possibly 
reachable, the garbage collector evacuates it from car set 1.2, which is to be reclaimed. 
Specifically, the garbage collector removes object C to a new car section, section 1.5, 
which is linked to the train to which the referring object F's car belongs. Of course, ob- 
ject F's reference needs to be updated to object C's new location. Fig. 12C depicts the 
evacuation's result. 

Fig. 12C also indicates that car set 1 .2 has been reclaimed, and car 1 .3 is next to 
be processed. The only address in car 1 .3's remembered set is that of a reference in ob- 
ject G. Inspection of that reference reveals that it refers to object F. Object F may there- 
fore be reachable, so it must be evacuated before car section 1.3 is reclaimed. On the 
other hand, there are no references to objects D and E, so they are clearly garbage. 
Fig. 12D depicts the result of reclaiming car 1 .3's space after evacuating possibly reach- 
able object F. 

In the state that Fig. 12D depicts, car 1 .4 is next to be processed, and its remem- 
bered set contains the addresses of references in objects K and C. Inspection of ob- 
ject K's reference reveals that it refers to object H, so object H must be evacuated. In- 
spection of the other remembered-set entry, the reference in object C, reveals that it refers 
to object G, so that object is evacuated, too. As Fig. 12E illustrates, object H must be 
added to the second train, to which its referring object K belongs. In this case there is 
room enough in car 2.2, which its referring object K occupies, so evacuation of object H 
does not require that object K's reference to object H be added to car 2.2's remembered 
set. Object G is evacuated to a new car in the same train, since that train is where refer- 
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ring object C resides. And the address of the reference in object G to object C is added to 
car 1.5's remembered set. 

Fig. 12E shows that this processing has eliminated all references into the first 
train, and it is an important part of the train algorithm to test for this condition. That is, 
even though there are references into both of the train's cars, those cars' contents can be 
recognized as all garbage because there are no references into the train from outside of it. 
So all of the first train's cars are reclaimed. 

The collector accordingly processes car 2.1 during the next collection cycle, and 
that car's remembered set indicates that there are two references outside the car that refer 
to objects within it. Those references are in object K, which is in the same train, and ob- 
ject A, which is not. hispection of those references reveals that they refer to objects I and 
I, which are evacuated. 

The result, depicted in Fig. 12F, is that the remembered sets for the cars in the 
second train reveal no inter-car references, and there are no inter-generational references 
into it, either. That train's car sections therefore contain only garbage, and their memory 
space can be reclaimed. 

So car 3.1 is processed next. Its sole object, object L, is referred to inter- 
generationally as well as by a reference in the fourth train's object M. As Fig. 12G 
shows, object L is therefore evacuated to the fourth train. And the address of the refer- 
ence in object L to object A is placed in the remembered set associated with car 3.2, in. 
which object A resides. 

The next car to be processed is car 3.2, whose remembered set includes the ad- 
dresses of references into it from objects B and L. Inspection of the reference from ob- 
ject B reveals that it refers to object A, which must therefore be evacuated to the fifth 
train before car 3.2 can be reclaimed. Also, we assume that object A cannot fit in car 
section 5.1, so a new car 5.2 is added to that train, as Fig. 12H shows, and object A is 
placed in its car section. All referred-to objects in the third train having been evacuated, 
that (single-car) train can be reclaimed in its entirety. 



23 



PATENT 
112047-0070 

A further observation needs to be made before we leave Fig. 12G. Car 3.2's re- 
membered set additionally lists a reference in object L, so the garbage collector inspects 
that reference and finds that it points to the location previously occupied by object A. 
This brings up a feature of copying-collection techniques such as the typical train- 
algorithm implementation. When the garbage collector evacuates an object fi:om a car 
section, it marks the location as having been evacuated and leaves the address of the ob- 
ject's mv/ location. So, vs^hen the garbage collector traces the reference from object L, it 
finds that object A has been removed, and it accordingly copies the new location into 
object L as the new value of its reference to object A. 

In the state that Fig. 12H illustrates, car 4.1 is the next to be processed. Inspection 
of the fourth train's remembered sets reveals no inter-train references into it, but the in- 
ter-generational scan (possibly performed with the aid of Fig. 6's card tables) reveals in- 
ter-generational references into car 4.2. So the fourth train cannot be reclaimed yet. The 
garbage collector accordingly evacuates car 4.1' s referred-to objects in the normal man- 
ner, vnih the result that Fig. 121 depicts. 

In that state, the next car to be processed has only inter-generational references 
into it. So, although its referred-to objects must therefore be evacuated from the train, 
they cannot be placed into trains that contain references to them. Conventionally, such 
objects are evacuated to a train at the end of the train sequence. In the illustrated imple- 
mentation, a new train is formed for this purpose, so the result of car 4.2's processing is 
the state that Fig. 12J depicts. 

Processing continues in this same fashion. Of course, subsequent collection cy- 
cles will not in general proceed, as in the illustrated cycles, without any reference 
changes by the mutator and without any addition of fiuther objects. But reflection re- 
veals that the general approach just described still applies when such mutations occur. 

A collector based on the train algorithm should collect the oldest train in a finite 
number of collections. The collection may be slow but it should persist. However, in 
one troublesome instance, the collector is imable to progress beyond the oldest train. 
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The problem is illustrated by the example where two objects in the oldest train, 
but not in the same collection set, reference each other, and where there is an extemal 
reference to at least one of the objects. A malicious application might change the object 
to which the extemal reference points resulting in a futile operation where the collector 
makes no progress. Fig. 16 illustrates this example. Suppose that car 1 .1 is the collec- 
tion set, objects A and B reference each other and they cannot be placed in the same car. 
When object A is considered for collection and the reference from B is found, object A is 
removed to a new car 1 .3 at the end of the same train, as in the standard Train algorithm 
operation. When car 1.2 is collected, if the root R is changed by an application from ref- 
erencing object B to referencing object A it is evident that the operation has made no 
progress. This futile collection cycle may continue preventing the collector from pro- 
gressing beyond the oldest train. 

Grarup and Seligman, ''Incremental Mature Garbage Collector, " M.Sc. Thesis, 
(available at http://www.daimi.au.dk/- jacobse/papers) approach this problem by remem- 
bering a previous root to another object in the oldest train. When that object becomes 
part of the collection set that object is evacuated to a younger train. The evacuation of 
this object will reduce the size of the train in the normal fashion thereby breaking the fu- 
tile situation. Moreover, the technique is implemented only after failure to make progress 
has been detected. However, one drawback is that this technique requires the overhead of 
evacuating objects that may be actually unreachable. 

There is a need to break the futile collection cycle in an efficient manner without 
copying unreachable dead objects. 

SUMMARY OF THE INVENTION 

The technique for overcoming the effects of otherwise futile cycles in a collector 
based on the Train algorithm is to augment the collection set with selected cars that 
eventually guarantee that progress is made. 

So the approach, after determining that a futile collections cycle has been entered, 
is to identify and include in the collection set one or more younger cars in the oldest train 
that contain objects referenced from outside the oldest train. These cars when added to 
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the collection-set and collected will reduce the volume of the oldest train and break the 
futile cycle. The added cars* remembered sets are scanned together with external sources 
of roots into the collection set and reachable objects are evacuated to appropriate younger 
trains. The only additional scanning that must be done is of cars in the oldest train that 
are not in the collection set but are older than any of the added cars. This is done since 
references from objects in these intervening cars will not necessarily be recorded in the 
added car' remembered sets. If any objects are evacuated to younger trains or if any ob- 
jects are foimd to have become unreachable reducing the size of the oldest train, then the 
futile-collection condition has been broken. 

Selection of cars for addition to the collection-set may be based on information 
gathered from previous collection increments. For example, the remembered sets for a 
car typically records information summarizing whether any objects in yoxmger trains re- 
fer to objects in that car. Similarly, car structures typically have a field indicating 
whether any external roots refer to objects in the car. This approach allows for the use of 
simple criteria to select cars but suffers from the fact that the information may be out-of- 
date. In the intervening period since the previous interval, the application may have 
modified references to objects. For this reason, this technique often succeeds but is not 
guaranteed to do so. As such, it may be attempted one or more times before attempting 
to guarantee progress and break the futile-collection condition. 

Selection of cars for addition to the collection set may also be based on informa- 
tion known to be accurate. For example, cars that have had references from yoimger 
trains recorded in this collection increment will be known to have references from outside 
the oldest train. Similarly, selection of cars guaranteed to break the futile-collection cy- 
cle may be done by collecting the added cars after the initial collection-set has been col- 
lected. This technique allows us to have accurate information about extemal roots into 
cars in the oldest train. As a last resort, the remembered sets of cars remaining in the old- 
est train indicating that they contain recorded references from younger trains may be 
scanned. If no such references are found and if no extemal roots refer to cars remaining 
in the oldest train, then the oldest train's car are imreachable and may be reclaimed as a 
group. 
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The approach may be further refined so that if a single object is observed in a car 
outside the collection set to be reachable from outside the oldest train, then rather than 
augmenting the collection set wdth an entire car, that single object may be evacuated. 
This observation may be based on the processing of the external roots into the collection 
set or it may be based on the processing of references recorded in the remembered sets of 
cars already in the collection set. Processing roots, references recorded in the object's 
car's remembered set, and references in older, non-collection set cars proceed normally 
except that only that object is evacuated. Finally, the car's remaining objects are scanned 
to update any references to the relocated object. 

These techniques for augmenting the collection set in order to break fiitile- 
coUection cycles represent an advance in that they both break such cycles and that they 
only evacuate objects currently known to be reachable. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention description below refers to the accompanying drawings, of which: 
Fig. 1, discussed above, is a block diagram of a computer system in which the 

present invention's teachings can be practiced; 

Fig. 2 is, discussed above, is a block diagram that illustrates a compiler's basic 

functions; 

Fig. 3, discussed above, is a block diagram that illustrates a more-complicated 
compiler/interpreter organization; 

Fig. 4, discussed above, is a diagram that illustrates a basic garbage-collection 
mechanism; 

Fig. 5, discussed above, is a similar diagram illustrating that garbage-collection 
approach's relocation operation; 

Fig. 6, discussed above, is a diagram that illustrates a garbage-collected heap's 
organization into generations; 

Fig. 7, discussed above, is a diagram that illustrates a generation organization em- 
ployed for the train algorithm; 
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Figs. 8A and 8B, discussed above, together constitute a flow chart that illustrates 
a garbage-collection interval that includes old-generation collection; 

Fig. 9, discussed above, is a flow chart that illustrates in more detail the remem- 
bered-set processing included in Fig. 8 A; 

Fig. 10, discussed above, is a block diagram that illustrates in more detail the re- 
ferred-to-object evacuation that Fig. 9 includes; 

Fig. 1 1, discussed above, is a flow chart that illustrates in more detail the Fig. 10 
flow chart's step of processing evacuated objects' references; 

Figs. 12A-12J, discussed above, are diagrams that illustrate a collection scenario 
that can result from using the train algorithm; 

Figs. 13A - 13B together constitute a flow chart that illustrates a collection inter- 
val, as Figs. 8A and 8B do, but illustrate optimization that Figs. 8A and 8B do not in- 
clude; 

Fig. 14 is a diagram that illustrates example data structures that can be employed 
to manage cars and trains in accordance with the train algorithm; 

Fig. 15 is a diagram that illustrates data structures employed in managing differ- 
ent-sized car sections; 

Fig. 16 is a block diagram illustrating the futile situation; 

Figs. 17 is a flow chart showing preparation for collecting. 

Figs. 18 and 19 are flow charts of a preferred process for breaking a futile situa- 
tion; and 

Figs. 20A and 20B are diagrams showing the addition of an object to a collection 
set to break a futile cycle. 

DETAILED DESCMPTION OF AN ILLUSTRATIVE 

EMBODIMENT 

The illustrated embodiment employs a way of implementing the train algorithm 
that is in general terms similar to the way described above. But, whereas it was tacitly 
assumed above that, as is conventional, only a single car section would be collected in 
any given collection interval, the embodiment now to be discussed may collect more than 
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a single car during a collection interval. Figs. 13A and 13B (together, "Fig. 13") there- 
fore depict a collection operation that is similar to the one that Fig. 8 depicts, but Fig. 13 
reflects the possibility of multiple-car collection sets and depicts certain optimizations 
that some of the invention's embodiments may employ. 

Blocks 172, 176, and 178 represent operations that correspond to those that 
Fig. 8's blocks 102, 106, and 108 do, and dashed line 174 represents the passage of con- 
trol from the mutator to the collector, as Fig. 8's dashed line 104 does. 

When the collector process begins, the collector prepares for the collection 175 by 
a process shown in and discussed with reference to Fig. 18. With reference to Fig. 13 A, 
when the yovmg generation collection 178 is started, and, if a futile cycle is detected 179, 
(see later discussion associated with Fig. 18) an optimistic strategy to break a futile cycle 
is tried. 

For the sake of efficiency, though, the collection operation of Fig. 13 includes a 
step represented by block 180. In this step, the collector reads the remembered set of 
each car in the collection set to determine the location of each reference into the collec- 
tion set from a car outside of it, it places the address of each reference thereby found into 
a scratch-pad list associated with the train that contains that reference, and it places the 
scratch-pad lists in reversed -train order. 

Before the collector processes references in that train's scratch-pad list, the col- 
lector evacuates any objects referred to from outside the old generation, as block 186 in- 
dicates. To identify such objects, the collector scans the root set. In some generational 
collectors, it may also have to scan other generations for references into the collection set. 
For the sake of example, though, we have assumed the particularly common scheme in 
which a generation's collection in a given interval is always preceded by complete col- 
lection of every (in this case, only one) younger generation in the same interval. If, in 
addition, the collector's promotion policy is to promote all surviving younger-generation 
objects into older generations, it is necessary only to scan older generations, of which 
there are none in the example; i.e., some embodiments may not require that the young 
generation be scanned in the block- 186 operation. 
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For those that do, though, the scanning may actually involve inspecting each sur- 
viving object in the young generation, or the collector may expedite the process by using 
card-table entries. Regardless of v^hich approach it uses, the collector immediately 
evacuates into another train any collection-set object to which it thereby finds an external 
reference. The typical policy is to place the evacuated object into the youngest such 
train. As before, the collector does not attempt to evacuate an object that has already 
been evacuated, and, when it does evacuate an object to a train, it evacuates to the same 
train each collection-set object to which a reference the thus-evacuated object refers. In 
any case, the collector updates the reference to the evacuated object. 

When the inter-generational references into the generation have thus been proc- 
essed, the garbage collector determines whether there are any references into the oldest 
train from outside that train. If not, the entire train can be reclaimed, as blocks 188 
and 190 indicate. 

As block 192 indicates, the collector interval typically ends when a train has thus 
been collected. If the oldest train cannot be collected in this manner, though, the collec- 
tor proceeds to evacuate any collection-set objects referred to by references whose loca- 
tions the oldest train's scratch-pad list includes, as blocks 194 and 196 indicate. It re- 
moves them to younger cars in the oldest train, again updating references, avoiding du- 
plicate evacuations, and evacuating any collection-set objects to which the evacuated ob- 
jects refer. When this process has been completed, the collection set can be reclaimed, as 
block 198 indicates, since no remaining object is referred to from outside the collection 
set: any remaining collection-set object is unreachable. At this point the system checks if 
a fiitile cycle exists and shoxild be handled 199. If yes control reverts to itme 1 82 of FIG. 
13A where remembered set entries in scratch pads have all been processed except for the 
oldest train. If not those entires are processed. And the collector goes on to process refer- 
ences and remembered set entries as shown in Fig. 13B. If at step 199 there is not futile 
cycle needing to be handled, the system relinquishes control to the mutator 192. 

We now turn to a problem presented by popular objects. Fig. 12F shows that 
there are two references to object L after the second train is collected. So references in 
both of the referring objects need to be updated when object L is evacuated. If entry du- 
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plication is to be avoided, adding remembered-set entries is burdensome. Still, the bur- 
den in not too great in that example, since only two referring objects are involved. But 
some types of applications routinely generate objects to which there are large numbers of 
references. Evacuating a single one of these objects requires considerable reference up- 
dating, so it can be quite costly. 

One way of dealing with this problem is to place popular objects in their own 
cars. To understand how this can be done, consider Fig. 14's exemplary data structures, 
which represent the type of information a collector may maintain in support of the train 
algorithm. To emphasize trains' ordered nature, Fig. 14 depicts such a structure 244 as 
including pointers 245 and 246 to the previous and next trains, although train order could 
obviously be maintained without such a mechanism. Cars are ordered within trains, too, 
and it may be a convenient to assign numbers for this purpose explicitly and keep the 
next number to be assigned in the train-associated structure, as field 247 suggests. In any 
event, some way of associating cars with trains is necessary, and the drawing represents 
this by fields 248 and 249 that point to structures containing data for the train's first and 
last cars. 

Fig. 14 depicts one such structure 250 as including pointers 251, 252, and 253 to 
structures that contain information concerning the train to which the car belongs, the pre- 
vious car in the train, and the next car in the train. Further pointers 254 and 255 point to 
the locations in the heap at which the associated car section begins and ends, whereas 
pointer 256 points to the place at which the next object can be added to the car section. 

As discussed later with respect to Figs. 18 and 19, flags are stored in the data 
structure shown in Fig. 14 on a per car basis. A flag 259 indicates that the previous car 
has extemal references; flag 261 indicates that the present car has extemal references, 
flag 263 indicates that the cumulative number of cars in the collection set has younger 
references, and flag 265 indicates that the current collector car has younger references. 

As will be explained in more detail presently, there is a standard car-section size 
that is used for all cars that contain more than one object, and that size is great enough to 
contain a relatively large number of average-sized objects. But some objects can be too 
big for the standard size, so a car section may consist of more than one of the standard- 
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size memory sections. Structure 250 therefore includes a field 257 that indicates how 
many standard-size memory sections there are in the car section that the structure man- 
ages. 

On the other hand, that structxire may in the illustrated embodiment be associated 
not with a single car section but rather with a standard-car-section-sized memory section 
that contains more than one (special-size) car section. When an organization of this type 
is used, structures like structure 250 may include a field 258 that indicates whether the 
heap space associated with the structure is used (1) normally, as a car section that can 
contain multiple objects, or (2) specially, as a region in which objects are stored one to a 
car in a manner that will now be explained by reference to the additional structures that 
Fig. 15 illustrates. 

To deal specially with popular objects, the garbage collector may keep track of 
the number of references there are to each object in the generation being collected. Now, 
the memory space 260 allocated to an object typically begins with a header 262 that con- 
tains various housekeeping information, such as an identifier of the class to which the 
object belongs. One way to keep track of an object's popularity is for the header to in- 
clude a reference-count field 264 right in the object's header. That field's default value is 
zero, which is its value at the beginning of the remembered-set processing in a collection 
cycle in which the object belongs to the collection set. As the garbage collector processes 
the collection-set cars' remembered sets, it increments the object's reference-count field 
each time it finds a reference to that object, and it tests the resultant value to determine 
whether the count exceeds a predetermined popular-object threshold. If the count does 
exceed the threshold, the collector removes the object to a "popular side yard" if it has 
not done so already. 

Specifically, the collector consults a table 266, which points to linked lists of 
normal-car-section-sized regions intended to contain popular objects. Preferably, the 
normal car-section size is considerably larger than the 30 to 60 bytes that has been shown 
by studies to be an average object size in typical programs. Under such circimistances, it 
would be a significant waste of space to allocate a whole normal-sized car section to an 
individual object. For reasons that will become apparent below, collectors that follow the 
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teachings of the present invention tend to place popular objects into their own, single- 
object car sections. So the normal-car-section-sized regions to which table 266 points are 
to be treated as specially divided into car sections whose sizes are more appropriate to 
individual-object storage. 

To this end, table 266 includes a list of pointers to linked lists of structures associ- 
ated with respective regions of that type. Each list is associated with a different object- 
size range. For example, consider the linked list pointed to by table 266's section 
pointer 268. Pointer 268 is associated with a linked list of normal-car-sized regions or- 
ganized into n-card car sections. Structure 267 is associated with one such region and 
includes fields 270 and 272 that point to the previous and next structure in a linked list of 
such structures associated with respective regions of «-card car sections. Car-section re- 
gion 269, with which structure 267 is associated, is divided into w-card car sections such 
as section 274, which contains object 260. 

More specifically, the garbage collector determines the size of the newly popular 
object by, for instance, consulting the class structure to which one of its header entries 
points. It then determines the smallest popular-car-section size that can contain the ob- 
ject. Having thus identified the appropriate size, it follows table 266's pointer associated 
with that size to the list of structures associated with regions so divided. It follows the 
list to the first structure associated with a region that has constituent car sections left. 

Let us suppose that the first such structure is structure 267. In that case, the col- 
lector finds the next fi'ee car section by following pointer 276 to a car data structure 278. 
This data structure is similar to Fig. 14's structure 250, but in the illustrated embodiment 
it is located in the garbage-collected heap, at the end of the car section with which it is 
associated. In a structure-278 field similar to structure 250's field 279, the collector 
places the next car number of the train to which the object is to be assigned, and it places 
the train's number in a field corresponding to structure 250's field 25 1 . The collector 
also stores the object at the start of the popular-object car section in which structure 278 
is located. In short, the collector is adding a hew car to the object's train, but the associ- 
ated car section is a smaller-than-usual car section, sized to contain the newly popular 
object efficiently. 
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The aspect of the illustrated embodiment's data-structure organization that 
Figs. 14 and 15 depict provides for special-size car sections without detracting from rapid 
identification of the normal-sized car to which a given object belongs. Conventionally, 
all car sections have been the same size, because doing so facilitates rapid car identifica- 
tion. Typically, for example, the most-significant bits of the difference between the gen- 
eration's base address and an object's address are used as an offset into a car-metadata 
table, which contains pointers to car structures associated with the (necessarily uniform- 
size) memory sections associated with those most-significant bits. Figs. 14 and 15's or- 
ganization permits this general approach to be used while providing at the same time for 
special-sized car sections. The car-metadata table can be used as before to contain point- 
ers to structures associated with memory sections whose uniform size is dictated by the 
number of address bits used as an index into that table. 

In the illustrated embodiment, though, the structures pointed to by the metadata- 
table pointers contain fields exemplified by fields 258 of Fig. 14's structure 250 and 
Fig. 15's structure 267. These fields indicate whether the structure manages only a single 
car section, as structure 250 does. If so, the structure thereby found is the car structure 
for that object. Otherwise, the collector infers from the object's address and the struc- 
ture's section_size field 284 the location of the car structure, such as structure 278, that 
manages.the object's special-size car section, and it reads the object's car number from 
that structure. This inference is readily drawn if every such car structure is positioned at 
the same offset from one of its respective car section's boundaries. In the illustrated ex- 
ample, for instance, every such car section's car structure is placed at the end of the car 
section, so its train and car-number fields are known to be located at predetermined off- 
sets from the end of the car section. 
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Turning now back to the futile cycle discussed above with respect to Fig. 16 and 
referenced in Fig. 13. The criterion used in this embodiment is that it is a sufficient con- 
dition that a futile collection cycle has not been entered if the object volume contained by 
a train being collected is reduced. 

One way of measuring the volume is to count the number of cars in a train, and 
some embodiments of the present invention will employ this approach. Another pre- 
ferred approach is to add up the number of bytes occupied by all objects in all the cars in 
the train being collected. 

The collector compares the volume that the oldest train contains after collection 
with the volume contained before the collection. It then resets to zero a no-progress 
counter if the train's volume was reduced by at least a minimum amount. For example, 
progress may be the reduction of the train's volume by only one byte. If no change oc- 
curred, the no-progress counter is incremented. A threshold value for the no-progress 
coimter is established at some prior time. The threshold value may be fixed or it may be 
variable based on, for example, the train's size, or the number of cars in a train. The ra- 
tionale is that the collector should collect the oldest train's contents some number of 
times before concluding the collection is futile. Note that if the collection set includes 
several cars, the no-progress counter may be incremented by the number of cars in the 
collection set rather than by one. The no-progress counter must reach the threshold be- 
fore a futile cycle is detected. A preferred embodiment uses a threshold value of N+1 for 
an optimistic approach to breaking a futile cycle and 2N for a pessimistic approach. N is 
the niraiber of cars in the oldest train. 

The detection and solution to breaking futile collection cycles is detailed herein in 
Figs. 17, 18, 19 and 20 which are more detailed processes shown as single corresponding 
blocks in Fig. 13, and with respect to the flags 252 in Fig. 14. In each case below, where 
cars are interrogated for references in remembered sets from younger trains or external 
roots, reference is made to the field in the data structure of Fig. 14, the relevant fields 
bemg the flags 259, 261, 263, and 265. 

Fig. 17 illustrates item 175 from Fig. 13A where the collector prepares for a col- 
lection increment 300 starting with the oldest car 302. If the current car's ext-refs flag 
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261 (of Fig. 14) is true, the car's previous-ext-refs flag 259 is set true. If not, set the car's 
ext-refs 259 flag as false 304. If the current-car's younger-refs flag 265 is true, then set 
the car's has-younger-refs flag 263 as true. If not, then set the car's younger-refs flag 263 
as false 306. If there are more cars in the collection set 308, the next younger car is inter- 
rogated via blocks 304 and 306, until all the cars in the collections have been considered 
310. After this the collector returns to item 176 in Fig. 13 A, the scanning of dirtied re- 
gions. 

With respect to Fig. 13A item 179, Fig. 18 illustrated an optimistic strategy to 
break any futile cycles detected 320. If this optimistic strategy fails, a pessimistic strat- 
egy is later discussed in Fig. 19 (item 199 of Fig. 13B). Referring to Fig. 18, if the entire 
train is in the collections set 322, the no-progress counter is reset, and the threshold is set 
based on the size of the now current oldest train 324. In this optimistic process the 
threshold is set to the N+1, where N is the number of cars in the current oldest train. 
Control is returned to the collector at item 180 of Fig. 13 A. 

If the entire oldest train is not in the collection set 326, then it will be possible to 
add more cars to the collection set with possible objects that can be relocated to reduce 
the trains volume and break any detected futile cycle. First, the no-progress counter is 
interrogated 328. If the value is not greater than the threshold, control is returned to item 
1 80 in Fig. 1 3 A. If the value is greater than the threshold, then the oldest car (now the 
current car) not in the collection set from the oldest train is considered 330. The current 
car's remembered set is interrogated for references from yoimger trains 332. If there are 
such references, the current car is added to the collections set 334 and control is returned 
agam to item 180 of Fig. 13 A. If there are no references in the remembered set from 
younger trains, but there are external references into that car found on a previous collec- 
tion increment 336, then the current car is added to the collection set 334 and again con- 
trol is returned to item 180 of Fig. 13 A. In these instances, the evacuation of Aat added 
car has good prospects of breaking any detected futile cycle by reducing the volume in 
the collection set. 

If there are no external references found on a previous increment into the current 
car 338, and there are no more cars in the oldest train, control is returned to item 180 in 
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Fig. 13 A. If there are cars in the oldest train not in the collection set 340, the next 
younger car in the oldest train becomes the current car 342 and item 332 and 336/338 are 
performed again on the current car. 

If item 199 is reached in Fig. 13B, and a futile cycle is detected, it is handled pes- 
simistically as shown in Fig. 19. If the volume of the oldest train decreased indicating a 
breaking of a futile cycle, the no progress counter is reset, and the threshold is set based 
on the size of the current oldest train 350. In this case the threshold is set to 2N, twice the 
number of cars in the oldest train. If the volume has not been reduced, the no-progress 
counter is interrogated 352. If the counter is not greater than the threshold, the number of 
cars in the collection set is added to the no-progress counter 354 and control returned. 

With respect to item 179, the optimistic strategy of Fig. 18, compared to the pes- 
shnistic strategy of Fig. 19, the thresholds are different where the threshold 2N of item 
199 is equal to or greater that the N+1 threshold of item 179. Another difference is re- 
lated to the placement in the two different Figs. The optimistic strategy 179 is based on 
the inclusive information previously collected while the pessimistic strategy 199 is based 
on current information. 

If the no-progress counter has reached the threshold, the oldest car in the oldest 
train outside the collection set is designated as the current car and considered 356. If the 
current car has references in the current collector increment from younger trains the cur- 
rent car is added to the collection set 358 and control retumed as in Fig. 13. If the current 
car has no references, but does have external references 360 that reach that car, that car is 
added to the collection set 358, and control is retumed. If the current car has no external 
references, but there are more cars in the oldest trains that are not in the collection set 
362, the next younger car in the oldest trains is designated as the current car 364 and in- 
terrogated for references from yoimger trains and for external references as just dis- 
cussed. 

If there are no more cars in the oldest train 366, the oldest car in the oldest train 
outside the collection set is considered 366 as the current car. If the current car has refer- 
ences recorded in its remembered set from younger trains 368, the remembered set entries 
from younger trains are scanned 370. If the reference is from a younger train the current 
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car is added to the collection set 372 and control returned. If no references from younger 
trains are found 374, and there are no more cars in the oldest train 376, the entire oldest 
train is reclaimed, the no-progress counter is reset and thresholds are set based on the size 
of the current train 378 as above to 2N, and control returned 380. If there are more cars 
in the oldest train not in the collection set 382, the current car is set to the next youngest 
car in the oldest train 384, and process loops back and that car is interrogated for refer- 
ences from younger trams 386. 

Figs. 20A and 20B illustrate evacuation of a single object added to break a ftitile 
cycle. In this case, consider there is a futile cycle comprising cars 402 and 406 with ob- 
jects A and B reference each other and the external ROOT. The motivation is to find 
some other reachable object, in the oldest train but not in the collection set, that can be 
added to the collection set and then successftiUy evacuated to reduce the size of the col- 
lection set and therefore break the futile cycle. Turning to FIG. 13 A, assume at step 179, 
a futile cycle has been detected. During steps 180, 182, 184, and 186, some object is 
identified (typically the first such observed) in some car in the oldest train outside the 
collection- set. This object is evacuated as if its car were part of the collection-set, and 
any other references to it observed during these four steps in collecting the collection-set 
are updated. Then, all references in objects in cars outside the collection-set, but older 
than or the same as the just-evacuated object are scanned and updated to reflect its new 
location. Finally examine the remembered-set entries in its former car, and leave them 
intact, but update all remaining references in the generation to the now-relocated object. 
In this case, as illustrated in FIGS. 20A and 20B, an object Y in a yoimger car 410 in the 
oldest train is found to be reachable from a younger train. Car 410 has reference in an 
object X 412 to object Y from a younger train. The collection, shown in FIG. 20B, 
evacuates 416 object Y to a car 414 in the same train as object X. That evacuation re- 
duces the volume of the oldest train thereby breaking the futile cycle. 

What is claimed is: 



38 



